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QJ Abstract 

Q In this paper, we settle the open complexity status of interval constrained coloring with a fixed 

t^. number of colors. We prove that the problem is already NP-complete if the number of different 

colors is 3. Previously, it has only been known that it is NP-complete, if the number of colors is 
part of the input and that the problem is solvable in polynomial time, if the number of colors is at 
most 2. We also show that it is hard to satisfy almost all of the constraints for a feasible instance. 
This implies APX-hardness of maximizing the number of simultaneously satisfiable intervals. 

1 Introduction 

In the interval constrained 3-coloring problem, we are given a set 1 of intervals defined on [n] := 
{1, . . . , n} and a requirement function r : X — > Z> , which maps each interval to a triple of non- 
negative integers. The objective is to determine a coloring x : [n] — > {1, 2, 3} such that each interval 
gets the proper colors as specified by the requirements, i.e. e xW = r (^) where ei, e2, e% are the 
three unit vectors of Z 3 . 

This problem is motivated by an application in biochemistry to investigate the tertiary structure 
of proteins as shown in the following illustration. More precisely, in Hydrogen-Deuterium-Exchange 
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Figure 1: Coloring of the residues of a protein chain according to their exchange rates. 

(HDX) experiments proteins are put into a solvent of heavy water (D2O) for a certain time after 
which the amount of residual hydrogen atoms, that have exchanged with deuterium atoms, is mea- 
sured [lj. Doing this experiment for several timesteps, one can determine the exchange rate of the 
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residues. These exchange rates indicate the solvent accessibility of the residues and hence they pro- 
vide information about the spatial structure of the protein. Mass spectroscopy is one of the methods 
for measuring these exchange rates. To this end, the proteins are digested, i.e. cut into parts which 
can be considered as intervals of the protein chain, and the mass uptake of each interval is measured. 
But thereby only bulk information about each interval can be obtained. Since there is not only one 
protein in the solvent but millions and they are not always cut in the same manner, we have this bulk 
information on overlapping fragments. That is, we are given the number of slow, medium, and fast ex- 
changing residues for each of these intervals and our goal is to find a feasible assignment of these three 
exchange rates to residues such that for each interval the numbers match with the bulk information. 

Though the interval constrained 3-coloring problem is motivated by a particular application, its 
mathematical abstraction appears quite simple and ostensibly more general. In terms of integer linear 
programming, the problem can be equivalently formulated as follows. Given a matrix A E {0, l} mXn 
with the row-wise consecutive-ones property and three vectors 61,2,3 £ 2<>o> the constraints 
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have a binary solution, i.e. £1,2,3 <E {0, l} n , if and only if the corresponding interval constrained 3- 
coloring problem has a feasible solution. We may assume w.l.o.g. that the requirements are consistent 
with the interval lengths, i.e. A ■ 1 = b\ + 62 + b%, since otherwise we can easily reject the instance as 
infeasible. Hence, we could treat 23 as slack variables and reformulate the constraints as 



Ax\ — 61, AX2 — 62, xi + X2 < 1. 



(2) 



It is known that if the matrix A has the column-wise consecutive ones property (instead of row-wise), 
then there is a reduction from the two-commodity integral flow problem, which has been proven to 
be NP-complete in Q. However, the NP-completeness w.r.t. row- wise consecutive ones matrices has 
been an open problem in a series of papers as outlined in the following subsection. 



1.1 Related Work 

The problem of assigning exchange rates to single residues has first been considered in [3]. In that 
paper, the authors presented a branch-and-bound framework for solving the corresponding coloring 
problem with k color classes. They showed that there is a combinatorial polynomial time algorithm 
for the case of k = 2. Moreover, they asked the question about the complexity for A: > 2. In 01, 
the problem has been called interval constrained coloring. It has been shown that the problem is NP- 
hard if the parameter k is part of the input. Moreover, approximation algorithms have been presented 
that allow violations of the requirements. That is, a quasi-polynomial time algorithm that computes a 
solution in which all constraints are (1 + e) -satisfied and a polynomial time rounding scheme, which 
satisfies every requirement within ±1, based on a technique introduced in 0. The latter implies that 
if the LP relaxation of ([]} is feasible, then there is a coloring satisfying at least ^ of the requirements. 
APX-hardness of finding the maximum number of simultaneously satisfiable intervals has been shown 
in for k > 2 provided that intervals may be counted with multiplicities. But still, the question 
about the complexity of the decision problem for fixed k > 3 has been left open. In |7|, several 
fixed parameter tractability results have been given. However, the authors state that they do not know 
whether the problem is tractable for fixed k. 
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1.2 Our contribution 



In this paper, we prove the hardness of the interval constrained fc-coloring problem for fixed parameter 
k. In fact, we completely settle the complexity status of the problem, since we show that already 
the interval constrained 3-coloring problem is NP-hard by a reduction from 3-SAT. This hardness 
result holds more generally for any problem that can be formulated like ([l}. Moreover, we even 
show the stronger result, that it is still difficult to satisfy almost all of the constraints for a feasible 
instance. More precisely, we prove that there is a constant e > such that it is NP-hard to distinguish 
between instances where all constraints can be satisfied and those where only a (1 — e) fraction of 
constraints can be simultaneously satisfied. To this end, we extend our reduction using expander 
graphs. This gap hardness result implies APX-hardness of the problem of maximizing the number of 
satisfied constraints. It is important to note that our construction does neither rely on multiple copies 
of intervals nor on inconsistent requirements for an interval, i.e. in our construction for every interval 
(z, j) we have unique requirements that sum up to the length of the interval. 



2 NP-hardness 

Theorem 1. It is NP-hard to decide whether there exists a feasible coloring \ f° r & n instance (X, r) of 
the interval constrained 3-coloring problem. 

Proof. The proof is by reduction from the 3-SAT problem. 

Suppose to be given an instance of the 3-SAT problem, defined by q clauses Ci, . . . , C q and p variables 
#1, . . . , x p . Each clause Ci (i = 1, . . . , q) contains 3 literals, namely 2/2(^)3 2/3 W- Each literal 
VhiP) (i = 1? • • • 5 Q an d h = 1, 2, 3) refers to a variable Xj, that means, it is equal to either xj or xj 
for some j in 1, . . . A truth assignment for the variables xi,...,x p satisfies the 3-SAT instance if 
and only if, for each clause, at least one literal takes the value true. 
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Figure 2: The sequence of nodes in an instance of the interval constrained 3-coloring problem. 

We now construct an instance of the interval constrained 3-coloring problem. For each clause 
Ci we introduce a sequence of consecutive nodes. This sequence is, in its turn, the union of three 
subsequences, one for each of the three literals (see Fig. [2]). 
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In the following, for the clarity of presentation, we drop the index i, if it is clear from the context. 
We denote color 1 by RED, color 2 by BLACK and color 3 by WHITE. 



Literal yi(i). The subsequence representing literal y\ is composed of 8 nodes. Among them, there 
are three special nodes, namely £i, f\ and a\, that play a key role since they encode the information 
about the truth value of the literal and of the variable xj it refers to. The basic idea is to achieve the 
following two goals: 1) given a feasible coloring, if x(h) is BLACK, we want to be able to construct 
a truth assignment setting xj to true, while if x(fi) * s BLACK, we want to be able to construct a truth 
assignment setting the variable xj to false; 2) given a feasible coloring, if x( a i) is RED, we want to 
be able to construct a truth assignment where y\ is true. 

To achieve the first goal, we will impose the following property: 

Property 2. In any feasible coloring, exactly one among t\ and fi will be BLACK. 

To achieve the second goal, and being consistent with the first one, we must have the property that: 

Property 3. In any feasible coloring, if x( a i) = RED, then x(h) = BLACK if y\ = xj, while 
x(fi) = BLACK ifyi = xj. 

To guarantee properties ^ and ([3]), we introduce a suitable set I(yi) of six interval^] shown in 
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Figure 3: Literal y\. The picture on the right shows the three feasible colorings. On a black and white 
printout red color appears as grey. 

The requirement function for such intervals changes whether y\ = xj or y\ = xj. If y\ = xj, 
weletr(Ji) = (1, 1, 1); r(/ 2 ) = (1, 1, 1); r(/ 3 ) = (1,0, 1); r(/ 4 ) = (1, 1,2); r(/ 5 ) = (0,1,0); 
t(Iq) = (2, 3, 3). For any feasible coloring there are only three possible outcomes for such sequence, 
reported in Fig. [3J3. Observe that the properties ^ and ^ are enforced. 

Now suppose that y\ = xj : then we switch the requirement function with respect to WHITE and 
BLACK, i.e. define it as follows: r(h) = (1,1,1); r(J 2 ) = (1,1,1); r(I 3 ) = (1,1,0); r(h) = 

1 ln principle, interval h and the node it contains are not needed. However, this allows to have the same number of 
WHITE and BLACK colored nodes for the sake of exposition. 
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(1, 2, 1); r(Is) = (0, 0, 1); t(Iq) = (2, 3, 3). Trivially, the possible outcomes for such sequence are 
exactly the ones in Fig. [S]} but exchanging the BLACK and WHITE colors. 



Literal y?>{i). The sequence of nodes representing literal y% is similar to the one representing y\. 
We still have a sequence of 8 nodes, and three special nodes £3, and a%. As before, we let £3 and 
encode the truth value of the variable Xj that is referred to by ys, while as encodes the truth value of 
the literal ys itself. Therefore, we introduce a set T(ys) of intervals in order to enforce the following 
properties: 

Property 4. In any feasible coloring, exactly one among £3 and fs will receive color BLACK. 

Property 5. In any feasible coloring, if x( a s) = RED, then xih) = BLACK if y% = xj, while 
X (fs) = BLACK if y 3 = Xj. 

Fig. [4^ shows the nodes and the six intervals that belong to T{ys)'. observe that the sequence is similar 
to the one representing y\, but the position of node a<$ and the intervals are now "mirrored". If y% = xj, 
weletr(Ji) = (1, 1, 1); r(/ 2 ) = (1, 1, 1); r(/ 3 ) = (1,0, 1); r(I 4 ) = (1, 1, 2); r(/ 5 ) = (0,1,0); 
r(Io) = (2, 3, 3). Fig.|4|3 reports the three possible outcomes for such sequence in a feasible coloring. 
Note that properties ([4]) and ([5]) hold. 

Now suppose that y% = xf once again, we switch the requirement function with respect to 
WHITE and BLACK. 
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Figure 4: Literal y$ 



Literal y<± {%). The sequence of nodes representing literal y<± is slightly more complicated. It is 
composed of 36 nodes, and among them there are 4 special nodes, namely £25/2, a 2 anc ^ a 2 ( see 
Fig. [5]). Still, we let £2 and S2 encode the truth value of the variable xj that is referred to by 2/2 > while 
a^ and a\ encode the truth value of the literal. 

Similarly to the previous cases, we want to achieve the following goals: 1) given a feasible color- 
ing, if x(h) is BLACK, we want to be able to construct a truth assignment setting the variable xj to 
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Figure 5: Literal y 2 



true, while if x(h) is BLACK, we want to be able to construct a truth assignment setting the variable 
Xj to false; 2) given a feasible coloring, if x( a i) = x( a 2 ) = RED, we want to be able to construct a 
truth assignment where the literal y 2 is true. We are therefore interested in the following properties: 

Property 6. In any feasible coloring, exactly one among t 2 and fa will receive color BLACK. 

Property 7. In any feasible coloring, if x(a|) = RED and x( a 2 ) = RED, then x{t 2 ) = BLACK 
ify 2 = xj, and x(fa) = BLACK ify 2 = xj. 

In this case, we introduce a set 1(2/2) of 14 suitable intervals, shown in Fig. [5] The requirements for 
the case y 2 = xj are given in the following table. 
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Observe that the set of intervals {1\ , . . . , 1$} is defined exactly as the set T(yz), therefore the possible 
outcomes for the sequence of 8 nodes covered by such intervals are as in Fig. (4]}. Similarly, the set 
of intervals {^7, . . . , I12} is defined exactly as the set X(yi), therefore the possible outcomes for the 
sequence of 8 nodes covered by such intervals are as in Fig. ^jp. Combining t(Iq) and r(Ii 2 ) with 
r(/i4), it follows that in any feasible coloring x, exactly one node among ^2 and /2 has WHITE (resp. 
BLACK) color, enforcing Property @. Still, note that if x(<4) = and xK) = then 

both the leftmost node and the rightmost node covered by interval /13 have color BLACK, therefore 
t 2 must have color WHITE otherwise r(Iis) is violated. Together with Property Q, this enforces 
Property ([7]). 

In case y 2 = xj, once again we switch the requirement function with respect to WHITE and 
BLACK. 

It remains to describe the role played by the first 13 nodes and the last 15 nodes of the sequence, that 
so far we did not consider. We are going to do it in the next paragraph. 

Intervals encoding truth values of literals. For each clause Q, we add another set X(Ci) of inter- 
vals, in order to link the nodes encoding the truth values of its three literals. The main goal we pursue 
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is the following: given a feasible coloring, we want to be able to construct a truth assignment such 
that at least one of the three literals is true. To achieve this, already having properties ([3]), ([5]) and ([7]), 
we only need the following property: 

Property 8. For any feasible coloring, if x( a i) RED andx{a?>) ^ RED , then x( a i) = x( a 2) = 
RED. 

Fig. [6] shows the six intervals that belong to T(Ci). The requirement function is: r{I\) = (1, 2, 2); 
r(J 2 ) = (1,2,2); r(J 3 ) = (1, 6, 6); r(/ 4 ) = (1,3,3); r(J 5 ) = (1,2,2); r(J 6 ) = (1,7,7). We now 
show that Property §8§ holds. Suppose % is a feasible coloring, and let ui, . . . , vis be the first 13 nodes 
of the sequence introduced for literal 2/2 • By construction, if x( a i) 7^ RED, then there is a node 
Vj : x(^j) = ^RS 1 ^ and j E {1,2,3} , otherwise r(ii) is violated. Similarly, if x( a 2) 7^ RED, 
then there is a node ^ : x( v j) = i^E-D and j E {11, 12, 13} , otherwise r(/ 2 ) is violated. On the 
other hand, this subsequence contains exactly one node with RED color, otherwise r(Is) is violated. 
It follows that at least one among a\ and has RED color. The same conclusions can be stated for 
nodes a r 2 and a%. Putting all together, it follows that the Property ^ holds. 
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Figure 6: Set of intervals T(Ci). 



Intervals encoding truth value of variables (later also called: variable intervals). Our last set of 
intervals will force different nodes to take the same color, if they encode the truth value of the same 
variable. In particular, we aim at having the following property: 

Property 9. In any feasible coloring, x(t^(i)) = x(t^(i ; )) if both literals yh(i) and yk(i f ) refer to the 
same variable xj . 

To achieve this, for each pair of such literals we add a big interval /(y^(z), yk{i r )) from fk(i') to th(i) 
(assuming i' < i without loss of generality). Note that, by construction, there is a subset of intervals 
that partitions all the internal nodes covered by the interval. That means, we know exactly the number 
of such nodes that must be colored with color RED, BLACK and WHITE (say z\ , Z2 , z% respectively). 
Then, we let the requirement function be r(I(yh(i), yk{i'))) = (^1,^2 + 1,^3 + 1). Under these 
assumptions, if % is a feasible coloring then x(th(i)) 7^ x(A(^))> an ^ i n particular one node will 
have WHITE color and the other one BLACK color. Combining this with properties Q,(|4]) and Q, 
the result follows. 

Notice that such an interval constrained 3 -coloring instance can clearly be constructed in polyno- 
mial time. Now we discuss the following claim in more details. 
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Claim 10. There exists a truth assignment satisfying the 3-SAT instance if and only if there exists a 
feasible coloring x for the interval constrained 3-coloring instance. 

First, suppose there exists a feasible coloring. We construct a truth assignment as follows. We 
set a variable Xj to true if x(th(i)) = BLACK, and to false otherwise, where yn(i) is any literal 
referring to xj. Note that, by Property ([9]), the resulting truth value does not depend on the literal we 
take. Still, combining Property ([8]) with properties ([3]), ([5]) and ([7]), we conclude that, for each clause, at 
least one literal will be true. By construction, we therefore end up with a truth assignment satisfying 
the 3-SAT instance. The result follows. 

Now suppose that there is a truth assignment satisfying the 3-SAT instance. The basic idea, is to 
construct a coloring x such that the following property holds for all literals: 

Property 11. x(th(i)) = BLACK (resp. WHITE) if and only if yh{i) refers to a true-variable (resp. 
false-variable). 

Consider the sequence of nodes representing literal 2/1(2), and suppose 2/1(2) = xj for some j. We 
color such nodes as in Fig. [S^-fiJ if the literal is true in the truth assignment, and as in Fig. [S^-fJJ 
otherwise. If 2/1(2) = xj, switch BLACK and WHITE colors, in both previous cases. Now focus on 
the sequence of nodes representing literal 2/3(2). If 2/3(2) = xj for some j, we color such nodes as in 
Fig. [4j)-(7,) if the literal is true, and as in Fig. j^-fJJ otherwise. If 2/3(2) = xj, switch BLACK and 
WHITE colors, in both previous cases. Finally, consider the sequence of nodes representing literal 
2/2(2)- Suppose 2/2(2) = xj. We color the 18 nodes in the middle of the sequence as in Fig. ^(1) 
if 1/2(2) is true, as in Fig. ^(2) if both 2/2(2) and 2/1(2) are false, and as in Fig. ^(3) otherwise. 
Once again, if 2/2(2) = xj, we switch BLACK and WHITE colors, in all the previous three cases. 
Notice that, by construction, Property ( pTj ) holds, and all requirements for the intervals in l(yh(i)) 
(i = 1, . . . , q and h = 1, 2, 3) are not violated. 
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Figure 7: Coloring of nodes representing literal 2/2 

Now we show how to color the first 13 nodes (vi, . . . , ^13) and the last 15 nodes (w\, . . . , ^15) of 
the sequence representing literal 2/2(2), in such a way that the requirements of the intervals ii, . . . , Iq 
in T(Ci) are not violated (i = 1, . . . , q). Note that, by construction, at least one node among a\ and 
a^ is colored with RED. In fact, if 2/1(2) is true then x( a i) = RED, while if 2/1(2) = false then 
a^ is colored with RED. Similarly, at least one node among as and a r 2 is colored with RED, since 
x( a 2) 7^ RED only if both literals 2/1(2) and 2/2(2) are false: then, necessarily 2/3(2) is true, and 
therefore x( a s) = RED. Let us focus on the nodes . . . , ^13, and let u be the node in between ^13 
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and a\. In the following, we refer to WHITE as the opposite color of BLACK and vice versa. As we 
already discuss, we can have only two cases: 

Case 1: x( a i) = x( a i) = RED. We color v\ with the opposite color of fi, and the nodes V2 
and v% with BLACK and WHITE. Note that r(I\) is not violated. We then color v±, ^5, vq with the 
opposite color of v\, ^2, ^3 respectively. Similarly, we color with the opposite color of u. Then, 
we color v\2 and vn with BLACK and WHITE, so that rfa) is not violated. Once again, we assign 
to i?io, vq, ^8 the opposite color of ^13, ^12, vu respectively. Finally, we let x( v 7) = RED. Note that 
r(Is) is not violated. 

Case 2: x( a i) 7^ RED and x( a i) = RED, or vice versa. Suppose x( a i) 7^ (the other 

case is similar). Both nodes a± and fi can have only BLACK or WHITE colors. Then, we can color 
vi and V2 with the opposite color of a\ and fi respectively, and ^3 with color RED, so that r(ii) is 
not violated. Still, we color V4 and v$ with the opposite color of v\ and V2. Finally, we color vq and V7 
with BLACK and WHITE. To the remaining nodes ug, . . . , ^13 we assign the same colors as in Case 
1. One checks that requirements of intervals I2 and ^3 are not violated. 

One can prove in a similar manner that nodes (wi, . . . , wis) can be properly colored, without 
violating the requirements of intervals I^^I^^Iq. 

Finally, since Property ( fTTj ) holds, it is easy to see that, for each couple of literals ^(i), ^(i 7 ), the 
requirement r(J(y/ l (i), yk(i f ))) is also not violated. The result then follows. 

□ 

3 Gap hardness 

We will now argue that not only the interval constrained 3 -coloring problem but also its gap version 
is NP-hard, i.e., it is hard to distinguish between satisfiable instances and those where only up to a 
(1 — e) fraction of constraints may be simultaneously satisfied. 

For the purpose of our argument we will use the following, rather restricted, definition of gap 
hardness. We will only talk about maximization versions of constraint satisfaction problems. Think 
of an instance of the problem as being equipped with an additional parameter t called threshold. We 
ask for a polynomial time algorithm which given the instance answers: 

• "YES" if all the constraints can be satisfied, 

• "NO" if there is no solution satisfying more than t constraints. 

Note that for instances, where more than t but not all constraints can be simultaneously satisfied, any 
answer is acceptable. We will now restrict our attention to the case where the threshold is a fixed 
fraction of the total amount of constraints in the instance. We call problem A to be gap NP-hard if 
there exists a positive e such that there is no polynomial time algorithm to separate feasible instances 
from those where only at most a (1 — e) fraction of the constraint can be simultaneously satisfied 
unless P = NP. 

Observe that gap NP-hardness implies APX-hardness, but not vice versa. For example the linear 
ordering problem (also known as max-subdag) is APX-hard |8 1, but is not gap NP-hard, since feasible 
instances may be found by topological sorting. 

Let us first note that the 3 -SAT problem, which we used in the reduction from the previous section, 
has the gap hardness property. It is the essence of the famous PCP theorems that problems with such 
gap hardness exist. For a proof of the gap hardness of 3-SAT see (9). 
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Before we show how to modify our reduction to prove gap hardness of the interval constraint 
coloring problem, we need to introduce the notion of expander graphs. For brevity we will only give 
the following extract from El . 

Definition 12. Let G = {V, E) be a d-regular graph. Let E(S, S) = | (S x S) fl E\ equal the number 
of edges from a subset S C V to its complement. The edge expansion of G is defined as 

h(G) mm — r— — . 

S:\S\<\V\/2 \S\ 

Lemma 13. There exists do E Z and ho > 0, such that there is a polynomial-time constructible 
family {X n } ne z of do -regular graphs X n on n vertices with h(X n ) > ho. (Such graphs are called 
expanders). 

Let us now give a "gap preserving" reduction from gap 3-SAT to gap interval constrained 3- 
coloring. Consider the reduction from the previous section. Observe that the amount of intervals in 
each literal gadget, and therefore also in each clause gadget, is constant. The remaining intervals are 
the variable intervals. While it is sufficient for the NP-hardness proof to connect occurrences of the 
same variable in a "clique" fashion with variable intervals, it produces a potentially quadratic number 
of intervals. Alternatively, one could connect these occurrences in a "path" fashion, but it would give 
too little connectivity for the gap reduction. The path-like connection has the desired property of using 
only linear amount of intervals, since each occurrence of a variable is linked with at most two other 
ones. We aim at providing more connectivity while not increasing the amount of intervals too much. 
A perfect tool to achieve this goal is a family of expander graphs. 

Consider the instance of the interval coloring problem obtained by the reduction from the previous 
section, but without any variable intervals yet. Consider literal gadgets corresponding to occurrences 
of a particular variable x. Think of these occurrences as of vertices of a graph G. Take an expander 
graph and connect two occurrences of x if the corresponding vertices in the expander are 

connected. For each such connection use a pair of intervals. These intervals should be the original 
variable interval and an interval that is one element shorter on each of the sides. We will call this pair 
of intervals a variable link. Repeat this procedure for each of the variables. 

Observe that the number of variable links that we added is linear since all the used expander 
graphs are do -regular. By contrast to the simple path-like connection, we now have the property, that 
different occurrences of the same variable have high edge connectivity. This can be turned into high 
penalty for inconsistent valuations of literals in an imperfect solution. 

Theorem 14. Constrained interval 3-coloring is gap NP-hard. 

Proof. We will argue that the above described reduction is a gap-preserving reduction from the gap 
3-SAT problem to the gap interval 3-coloring problem. We need to prove that there exists a positive e 
such that feasible instances are hard to separate from those less than (1 — e) satisfiable. 

Let eo be the constant in the gap hardness of gap 3-SAT. We need to show two properties: that the 
"yes" instances of the gap 3-SAT problem are mapped to "YES" instances of our problem, and also 
that the "NO" instances are mapped to "NO" instances. 

The first property is simple, already in the NP-hardness proof in the previous section it was shown 
that feasible instances are mapped by our reduction into feasible ones. To show the second property, 
we will take the reverse direction and argue that an almost feasible solution to the coloring instance 
can be transformed into an almost feasible solution to the SAT instance. 
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Suppose we are given a coloring x that violates at most e fraction of the constraints. Suppose the 
original 3 -SAT instance has q clauses, then our interval coloring instance has at most c • q intervals for 
some constant c. The number of unsatisfied intervals in the coloring \ * s then at most eqc. 

We will say that a clause is broken if at least one of the intervals encoding it is not satisfied by x- 
We will say that a variable link is broken if one of its intervals is not satisfied or one of the clauses it 
connects is broken. An unsatisfied variable link interval contributes a single broken link; an unsatisfied 
interval within a clause breaks at most 3g?o intervals connected to the clause. Therefore, there is at 
most 3doeqc broken variable links in total. 

Recall that each variable link that is not broken connects occurrences of the same variable in two 
different not broken clauses. Moreover, by the construction of the variable link, these two occurrences 
display the same logical value of the variable. 

Consider the truth assignment (p obtained as follows. For each variable consider its occurrences 
in the not broken clauses. Each occurrence associates a logical value to the variable. Take for this 
variable the value that is displayed in the bigger set of not broken clauses, break ties arbitrarily. 

We will now argue, that <p> satisfies a big fraction of clauses. Call a clause bad if it is not broken, 
but it contains a literal such that in the coloring \ this literal was active, but (p evaluates this literal to 
false. Observe that if a clause is neither broken nor bad, then it is satisfied by <p. It remains to bound 
the amount of bad clauses. 

Consider the clauses that become bad from the choice of a value that (p assigns to a particular 
variable x. Let b x be the number of such clauses. By the connectivity property of expanders, the 
amount of variable links connecting these occurrences of x with other occurrences is at least h$b x . As 
we observed above, all these variable links are broken. Since there are in total at most 3doeqc broken 
links, we obtain that there is at most j^d^eqc bad clauses. Hence, there are at most (^o?o + l)eqc 
clauses that are either bad or broken and they cover all the clauses not satisfied by <p. 

It remains to fix e = j^fh^Jc 60 t0 °htain the property, that more than eo unsatisfiable instances 
of 3-SAT are mapped to more than e unsatisfiable instances of the constrained interval 3-coloring 
problem. □ 
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